Device and method for photo and video capture

ABSTRACT

A single user input element in an image capture device is used for both photo and video capture. Based on a first user interface activity, a timing reference is engaged at a first reference time. In a first case, photo capture is performed. In the first case, a second reference time is based on a second user interface activity, and the timing reference indicates a passage of time between the two reference times is shorter than a particular time interval. In a second case, video capture is performed. In the second case, the timing reference indicates a passage of time since the first reference time is equal to or longer than the particular time interval. Video capture is stopped based on a subsequent user interface activity. The user interface activities may comprise detecting actions based on the same type of physical manipulation of the single user input.

FIELD

This relates to devices and methods for photo and video capture with a single user input element. The same single user input element in an image capture device is used in performing both photo capture and video capture.

BACKGROUND

Image capture devices with the ability to capture both static content, such as still images, and dynamic content, such as moving images embodied as video or audiovisual content, have become widely available. Historically, still cameras for still images were distinct in design and functionality from video cameras for moving images. For instance, there has been a long history of some cameras designed for photography and other cameras designed for cinematography. Technological developments in image capture devices, however, has enabled the proliferation of cameras with the ability to capture both still images and moving images.

Even if the same camera has the ability to capture both static and dynamic image content, the camera's user interface must be able to distinguish between the user's instruction for still image capture and the user's instruction for moving image capture. Various techniques have become conventionally known.

One technique is to use multiple buttons, each button assigned its own separate function. One button may be assigned to initiate photo capture. Another button may be assigned to initiate video capture. Commercial shoot-and-point cameras have been known to employ this technique of a separate photo button and a separate video button. This technique can be disadvantageous in a number of ways. For example, multiple buttons may require more hardware and its associated costs in design and manufacturing, as compared to a single button. As another example, multiple buttons may occupy extra space on the form factor of the camera device. As many modern camera devices are small in size for convenience and portability, there is limited space for user input elements, such as buttons.

Another set of conventional techniques is directed to using a single capture button with multiple capture functions. A first example employs a separate mode selection button to toggle between photo capture mode and video capture mode. In photo capture mode, operating the single capture button initiates photo capture. In video capture mode, operating the same single capture button initiates video capture. Handheld smartphones have been known to employ this technique of a separate mode selection button to toggle between photo capture mode and video capture mode. This technique can be disadvantageous because an extra user operation is required to switch between these two modes. Specifically, before the user operation of pressing the capture button, the extra user operation of pressing the mode selection button and the subsequent mode change are required. This extra user operation and mode change introduces a noticeably long time delay before the user can press the capture button. During that long time delay, the user may miss the opportunity to capture the desired photo or video.

A second example employs a single user input element that functions both as a mode selection input and as a capture button of a camera, as shown in U.S. Patent Application Publication No. 2004/0061788 A1 to Bateman. Specifically, each of these two functions is implemented by a different physical manipulation of the single user input element. For the function of mode selection, the physical manipulation of sliding the single user input element selects the different capture modes of the camera. For the function of capture, the physical manipulation of pressing the single user input element initiates capture.

Different modes of the camera are selected by a user operation of sliding the single user input element between two different slide positions. In a first slide position, the camera is in still image capture mode, and pressing the single user input element initiates still image capture. In a second slide position, the camera is in video capture mode, and pressing the single user input element initiates video capture.

The technique of this second example can be disadvantageous in a number of ways. First, an extra physical manipulation of a different type is required to switch between these two modes. Specifically, before the physical manipulation of pressing the single user input element, the extra physical manipulation of sliding the single user input element is required. Pressing and sliding are different types of physical manipulation. This extra physical manipulation of a different type introduces a noticeably long time delay before the user can press the single user input element. During that long time delay, the user may miss the opportunity to capture the desired photo or video. Additionally, accommodating multiple sliding positions may require more hardware and its associated costs in design and manufacturing, as compared to a single button occupying a single location. Furthermore, multiple sliding positions may occupy extra space on the form factor of the camera device.

SUMMARY

Various embodiments involve an image capture apparatus comprising an image sensor, a timing reference, a single user input element, and a processor block configured to control a method for photo capture and video capture. Other embodiments involve a machine-readable medium for an image capture device directed to a method for photo capture and video capture. A method for photo capture and video capture involves steps for performing photo capture and steps for performing video capture.

Some embodiments involve engaging a timing reference at a first reference time based on a first user interface activity at the single user input element. In a first case, a second reference time is based on a second user interface activity at the single user input element, the timing reference indicates a passage of time between the two reference times is shorter than a particular time interval, and still image capture is performed. In a second case, the timing reference indicates a passage of time since the first reference time is equal to or longer than the particular time interval, and video capture is started. In the second case, video capture is stopped based on a subsequent user interface activity at the single user input element. The reference time for a corresponding user interface activity may be equal to or later than the time when the corresponding user interface activity occurs at the single user input element.

In some embodiments, the steps for performing photo capture involve engaging a timing reference at two reference times based on two user interface activities at the single user input element. After determining that a timing reference indicates that a passage of time between the two reference times is shorter than a particular time interval, still image capture is performed based on the determination. The reference times may be equal to or later than the two times when the two user interface activities occur at the single user input element.

In some embodiments, the steps for performing video capture involve engaging a timing reference at a first reference time based on a first user interface activity at the single user input element. After determining that a timing reference indicates a passage of time since the first reference time is equal to or longer than the particular time interval, video capture is started based on the determination. Based on another user interface activity at the single user input element, the video capture is stopped. The first reference time may be equal to or later than the time when the first user interface activity occurs at the single user input element.

Implementing the single user input element with the time-based approach disclosed herein distinguishes from other approaches based on multiple buttons and multiple types of physical manipulation. Comparative advantages may include reduced hardware and its associated costs in design and manufacturing, increase in available space on the form factor of a camera device, removal of the noticeably long time delay of alternating between multiple modes and multiple user operations, and increased simplicity in user interface experience.

In some embodiments, all of the user interface activities at the single user input element comprise detecting actions based on the same type of physical manipulation of the single user input. The corresponding user actions on the single user input all belong to the same type of physical manipulation. Compared to the known use of multiple types of physical manipulation of a single user input element, the aspect of using the same single type of physical manipulation can lead to further advantages, such as removal of the noticeably long time delay of alternating between multiple types of physical manipulations, further reduced hardware and its associated costs in design and manufacturing a single user input element that operates via multiple types of physical manipulation, further increase in available space on the form factor of a camera device, and further increased simplicity in user interface experience.

In some embodiments, some user interface activities involve detecting press actions on the single user input element, and other user interface activities involve detecting release actions from the single user input element. Employing press and release actions may provide a convenient user experience.

In some embodiments, some user interface activities involve a change in the state of the single user input element from OFF to ON, and other user interface activities involve a change in the state of the single user input element from ON to OFF. Employing ON and OFF states may provide a convenient implementation of a single user input element.

In some embodiments, photo capture and video capture employ the same image sensor. Employing the same image sensor may provide benefits in reduced hardware costs and increased device portability.

In some embodiments, the single user input element is one of a button, a key, a screen area of an input display, and a touch-sensitive area of a touchscreen. Such various examples indicate that the single user input element may be embodied in any suitable form in accordance with the teachings disclosed herein.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of an overall system design in a preferred embodiment.

FIG. 2 illustrates an embodiment in an idle status.

FIG. 3 illustrates an embodiment in connection with photo capture.

FIG. 4 illustrates an embodiment in an idle status after photo capture.

FIG. 5 illustrates an embodiment in connection with video capture.

FIG. 6 illustrates an embodiment in an idle status after video capture.

FIG. 7 illustrates exemplary user activity.

FIG. 8 shows a temporal sequence of user events related to photo capture.

FIG. 9 shows a temporal sequence of user events related to video capture start.

FIG. 10 shows a temporal sequence of user events related to video capture stop.

FIG. 11 shows an exemplary operational diagram.

FIG. 12 shows an exemplary hardware embodiment.

DETAILED DESCRIPTION

In the following description of preferred embodiments, reference is made to the accompanying drawings which form a part hereof, and in which it is shown by way of illustration specific embodiments in which the invention can be practiced. It is to be understood that other embodiments can be used and structural changes can be made without departing from the scope of the embodiments of this invention.

Overall System Design of the Preferred Embodiment

FIG. 1 is a block diagram of an overall system design in a preferred embodiment 100. Embodiment 100 may be implemented with any device configured to capture, process, store, or transmit (or any combination of the above steps) both photos and videos utilizing image sensors, such as image sensor 140. Such a device may also utilize audio sensors, such as audio sensor 130.

A user may issue commands to embodiment 100 to effectuate the capture of photo and/or video content. The user may be external to embodiment 100.

Control block 110 may represent the sequence of interactions between the user and embodiment 100 resulting in control commands and leading to the capture of photo and/or video content. Control block 110 may be implemented with central processing unit (CPU) 111, bus 112, and timing reference 114. CPU 111 may comprise one or more processors. Bus 112 may comprise one or more buses interconnecting the various components of embodiment 100. Timing reference 114 may be embodied as any way or component involved with timing references, such as one or more timers, one or more clocks, a combination of timer(s) and clock(s), and any process or component that provides the functionality of a clock or timer. Using or engaging timing reference 114 includes various timing-related activities, such as indicating event times, starting timers, reading clocks or timers, indicating passage of time, setting or registering reference times, etc.

User interface 120 is configured to intermediate the interactions between the embodiment 100 and the user. Via user interface 120, the user may effectuate the execution of the processes of control block 110. User interface 120 may also provide feedback about the results of various processes. User interface 120 may be implemented with visual, acoustic, tactile, or proximity elements (or any combination of the above elements). As a result of user interaction with user interface 120, control block 110 may generate a control command, such as a signal that affects the rest of the system of embodiment 100 (e.g., a Capture Photo signal, a Start Video signal, a Stop Video signal).

Audio sensor 130 registers acoustic input data and may transform acoustic input data into electrical audio signals for further processing. The acoustic input data can be processed based on the commands from control block 110, in connection with producing photo or video content. Audio sensor 130 may represent one or more audio sensors.

Image sensor 140 registers visual input data and transforming visual input data into electrical image signals for further processing. The visual input data can be processed based on commands from control block 110, in connection with producing photo or video content. Image sensor 140 may represent one or more image sensors.

Display 150 can provide a real-time visual display of the image signals generated by image sensor 140. Display 150 may be structurally integrated with embodiment 100 as a unitary device.

Speaker 160 is an audio reproduction device used to produce audible output, such as audible feedback. Speaker 160 may be structurally integrated with embodiment 100 as a unitary device.

Video processing 170 represents a set of transformations and the equipment and control processes involved to produce video content based on the image signal generated by the image sensor 140. The video content may be produced based also on the audio signal generated by audio sensor 130. The video content is a dynamic sequence of images captured by image sensor 140, which may be simultaneously captured with synchronized audio captured by audio sensor 130. The video content may be processed and formatted for storage, subsequent retrieval, and transmission.

Photo processing 180 represents a set of transformations and the equipment and control processes involved to produce photo content from the image signal generated by image sensor 140. The photo content may be produced based also on the audio signal generated by audio sensor 130. The photo content is a static image (a momentary representation) captured by image sensor 140, which may be simultaneously captured with synchronized audio captured by audio sensor 130. The photo content may be processed and formatted for storage, subsequent retrieval, and transmission.

Storage 190 may store video content from video processing 170. Storage 190 may store photo content from photo processing 180. Storage 190 may be embodied as any suitable storage medium for storing video content or photo content, including the non-limiting examples of tape storage on magnetic tape, disk storage on a hard disk drive, flash memory of solid-state semiconductor memory.

Embodiment 100 in FIG. 1 represents relevant processes and components. These relevant processes and components may be implemented by any suitable hardware configuration. For example, video processing 170 and photo processing 180 are illustrated as separate blocks but may be described together as image processing 175. Video processing 170 and photo processing 180 may be implemented respectively as a separate video processor component and a separate photo processor component. Alternatively, they may be embodied as a single image processor component corresponding to image processing 175. Additionally, any combination of video processing 170, photo processing 180, image processing 175, and CPU 111 may be implemented as one or more processors. Some or all of the elements shown in control block 110 and image processing 175 may be embodied in a single integral structure or a single collection of components, as shown by processing block 115.

FIG. 12 illustrates a hardware embodiment 200 for implementing design embodiment 100. The components in FIG. 12 have reference characters that are numbered in a corresponding fashion to the reference characters in FIG. 1. Hardware embodiment 200 may be implemented in devices including, but not limited to, a portable camera phone, smartphone, digital camcorder, digital camera, tablet computer, laptop computer, desktop personal computer (PC), personal digital assistant (PDA), media player, and a combination of two or more of these devices.

Control block 210 may be implemented with CPU 211 and timing reference 214. Bus 112 may connect control block 210 to image processor 275. Image processor 275 may comprise video processor 270 and photo processor 280. Methods and processes disclosed herein may be embodied as instructions stored on a machine-readable storage medium, such as random access memory (RAM) 216 or read-only memory (ROM) 217. One or more processors, such as CPU 211, may execute these instructions to cause embodiment 100 to perform the methods and processes disclosed herein. Some or all of the elements shown in control block 210 and image processor 275 may be embodied in a single integral structure or a single collection of components, as shown by processing block 215.

User interface 220 may be embodied as a single user input element, such as a button, a key, a screen area of an input display, a touch-sensitive area of a touchscreen, etc. Audio sensor 230 may be embodied as a microphone. Image sensor 240 may be embodied as one or more image sensors. Display 250 may be embodied as a preview display for image signals from image sensor 140. Display 250 may be structurally separate from user interface 220 or structurally integrated with user interface 220, such as a touchscreen as an input/output device. Speaker 260 may be embodied as one or more speakers or one or more audio output ports to an audio peripheral, e.g., headphones. Storage 290 may be embodied as any suitable storage medium for storing video content or photo content, including the non-limiting examples of tape storage on magnetic tape, disk storage on a hard disk drive, and flash memory of solid-state semiconductor memory. Network interface 205 may be embodied as any suitable connection (e.g., physical data port, virtual data port, infrared connection, wireless radio frequency (RF) connection, etc.) to a communications network (e.g., a local area network, a storage area network, the Internet, cellular data network, etc.) via a propagation medium (e.g., electrical wires, optical fiber, acoustic space, wireless signals, RF signals, etc.). Data may enter or leave embodiment 200 via network interface 205, including program instructions for RAM 216 or ROM 217 and including photo or video data for storage 290. The various components of embodiment 200 may be interconnected by any suitable arrangement of interconnections, as represented by bus 112.

User Interface and Actions

User interface 120 comprises a single actionable user input element 121 (e.g., a button, a key, a screen area of an input display, a touch-sensitive area of a touchscreen, etc.). Element 121 can indicate two states ON and OFF. Element 121 may be in the OFF state. The user can change the state of element 121 to ON by applying a first action (e.g., button press, screen touch). Element 121 can indicate the ON state for a duration of time. For instance, the user may apply the first action during the duration of time. The user can change the state of element 121 to OFF by applying a second action (e.g., button release, screen touch release). For instance, the second action may be ceasing the first action so that the first action is no longer applied to element 121, returning element 121 to the OFF state.

The first action and the second action on the single user input element 121 may belong to the same type of physical manipulation. For example, the first action and the second action may involve a pressure type of physical manipulation, as in applying pressure to a button and releasing pressure from the button. As another example, the first action and the second action may involve a contact type of physical manipulation, as in contacting a touch-sensitive surface and removing contact from the touch-sensitive surface. As a further example, the first action and the second action may involve a proximity type of physical manipulation, as in entering a proximity detection region and exiting a proximity detection region.

Hereinafter, the user interaction leading to the state of actionable element 121 switching from OFF to ON is referred to as a “PRESS” action, and the user interaction leading to state of actionable element 121 switching from ON to OFF is referred to as a “RELEASE” action. The time elapsed between a PRESS action and a subsequent RELEASE action is referred to as “HOLD time.” During HOLD time, the user may perform a “HOLD action” by continuing to apply the PRESS action to element 121. Embodiments are not limited to a pressing action and a releasing action but may also include other actions.

FIG. 7 illustrates exemplary user activity. PRESS action 124 shows the action of a user moving a finger into contact with a control surface. RELEASE action 126 shows the action of a user moving a finger out of contact with a control surface. FIG. 7 also shows an idle status 121 where a user is idling and not contacting a control surface.

Hereinafter, a predetermined duration of time is referred to as “Interval X.” The duration of Interval X may be a plurality of seconds, a second, or a fraction of a second. The relevance of Interval X is further explained below.

Idle

FIG. 2 illustrates embodiment 100 in an idle status. Image sensor 140 is connected to display 150, as shown by connection 142 via bus 112. Display 150 may show a real-time preview of image signals generated by image sensor 140. User input element 121 is in the OFF state.

Photo Capture

FIG. 8 shows a temporal sequence of user events related to photo capture. In order to perform photo capture, a user may execute the following sequence of stages:

Stage P1. While embodiment 100 is in an Idle status (as in FIG. 2), the user changes the state of element 121 to ON. As exemplified in FIG. 8, the user performs a PRESS action 124 on user input element 121 at time TA.

Stage P2. The state of element 121 remains ON for a period of time shorter than Interval X, e.g., the user performs a HOLD action for a HOLD time less than Interval X.

Stage P3. The user changes the state of element 121 to OFF. As exemplified in FIG. 8, the user performs a RELEASE action 126 on user input element 121 at time TB. The time difference between time TB and time TA is less than Interval X. When the state of element 121 is OFF, embodiment 100 performs photo capture at time TB or later.

Photo capture may also be described in terms of embodiment 100 and FIGS. 2-4. During stages P1-P3, embodiment 100 progresses through the statuses shown in FIGS. 2-4. The idle status of FIG. 2 is described above.

FIG. 3 illustrates embodiment 100 in connection with photo capture. User activity 122 represents the combination of PRESS action 124 and RELEASE action 126 of FIG. 8. Image sensor 140 is connected to display 150, as shown by connection 142 via bus 112. Display 150 may show a real-time preview of image signals generated by image sensor 140. Image sensor 140 is also connected to photo processing 180, as shown by connection 144 via bus 112. Timing reference 114 is referenced to determine that user input element 121 remains ON for a period of time shorter than Interval X. Photo processing 180 captures initial image 182. Photo processing 180 may further transform or process initial image 182 into final image 192 for storage and/or transmission. Photo processing 180 may store final image 192 into storage 190 and may transmit final image 192 to another destination.

FIG. 4 illustrates embodiment 100 in an idle status after photo capture. This idle status of FIG. 4 may correspond to the idle status of FIG. 2, as discussed above. In this idle status of FIG. 4, storage 190 may contain final image 192.

Photo capture can be enhanced to provide feedback to the user about the progress and the final result of stages P1-P3. Audio feedback enhancements may utilize speaker 160. A distinct audio signal can indicate that photo capture has been triggered. A distinct audio signal can indicate that photo capture has been triggered and successfully completed. A distinct audio signal can indicate that photo capture has been triggered and failed.

Visual feedback enhancements may utilize display 150. A distinct visual signal can indicate that photo capture has been triggered. A distinct visual signal can indicate that photo capture has been triggered and successfully completed. A distinct visual signal can indicate that photo capture has been triggered and failed.

Video Capture

FIG. 9 shows a temporal sequence of user events related to video capture start. FIG. 10 shows a temporal sequence of user events related to video capture stop. In order to perform video capture, a user operator may execute the following sequence of stages:

Stage V1. While embodiment 100 is in an Idle status (as in FIG. 2), the user changes the state of element 121 to ON. As exemplified in FIG. 9, the user performs a PRESS action 124 on user input element 121 at time TA.

Stage V2. The state of element 121 remains ON for a period of time equal to or longer than Interval X, e.g., the user performs a HOLD action for a HOLD time equal to or greater than Interval X. When element 121 remains ON for a period of time equal to or longer than Interval X, embodiment 100 initiates the recording of video content, as reflected by start recording event 174 at preset time TS or later in FIG. 9. The time difference between time TS and TA is equal to or greater than Interval X.

Stage V3. The user changes the state of element 121 to OFF. As exemplified in FIG. 10, the user performs a RELEASE action 128 on user input element 121 at time TC. When the state of element 121 is OFF, embodiment 100 stops the recording of video content at time TC or later.

Video capture may also be described in terms of embodiment 100 and FIGS. 2 and 5-6. During stages V1-V3, embodiment 100 progresses through the statuses shown in FIGS. 2 and 5-6. The idle status of FIG. 2 is described above.

FIG. 5 illustrates embodiment 100 in connection with video capture. User activities 124 and 128 respectively correspond to PRESS action 124 and RELEASE action 128 of FIG. 10. Image sensor 140 is connected to display 150, as shown by connection 142 via bus 112. Display 150 may show a real-time preview of image signals generated by image sensor 140. Image sensor 140 is also connected to video processing 170, as shown by connection 146 via bus 112. Timing reference 114 is engaged to determine that user input element 121 remains ON for a period of time equal to or longer than Interval X. In order to capture initial video 172, video processing 170 captures video data from image sensor 140 via connection 146 and may also capture audio data from audio sensor 130 by connection 136 via bus 112. Video processing 170 may further transform or process initial video 172 into final video 194 for storage and/or transmission. Video processing 170 may store final video 194 into storage 190 and may transmit final video 194 to another destination.

FIG. 6 illustrates embodiment 100 in an idle status after photo capture. This idle status of FIG. 6 may correspond to the idle status of FIG. 2, as discussed above. In this idle status of FIG. 6, storage 190 may contain final video 194.

Video capture can be enhanced to provide feedback to the user about the progress and the final result of stages V1-V3. Audio feedback enhancements may utilize speaker 160. A distinct audio signal can indicate that video recording has been triggered. A distinct audio signal can indicate that video recording has been triggered and successfully completed. A distinct audio signal can indicate that video recording has been triggered and failed.

Visual feedback enhancements may utilize display 150. A distinct visual signal can indicate that video recording has been triggered. A distinct visual signal can indicate the current duration of recording. A distinct visual signal can indicate that video recording has been successfully completed. A distinct visual signal can indicate that video recording has failed.

Operational Diagram of the Preferred Embodiment

The methods and processes disclosed herein may also be described in terms of operational diagrams. FIG. 11 shows an exemplary operational diagram in accordance with the preferred embodiment of the present invention. The left side of FIG. 11 shows activity at user interface 120 of embodiment 100. The right side of FIG. 11 shows activity of other portions of the system of embodiment 100.

State S1 is an idle state of embodiment 100. Image sensor 140 may be operating. Display 150 may be showing a live preview of image signals generated by image sensor 140. User input element 121 is in the OFF state. Optionally, display 150 may show visual cue(s) on or around a control surface of user interface 120, indicating the OFF state of element 121 and prompting the user to perform an action (e.g., a prompt for a PRESS action).

State S2 is a timing start state of embodiment 100. When the state of element 121 changes to ON, embodiment 100 transitions from idle state S1 to timing start state S2 via activity U12. User interface activity U12 is exemplified as detecting a user interacting with the control surface of user interface 120 via a PRESS action 124. Optionally, display 150 may show visual cue(s) on or around the control surface of user interface 120, indicating the ON state of element 121.

In timing start state S2, embodiment 100 may engage timing reference 114 to register a time TA, which is the time when the state of element 121 changes to ON or a later point in time. Timing reference 114 may be engaged to start an interval timer or read an interval clock. By engaging timing reference 114, embodiment 100 determines whether the state of element 121 remains ON for a period of time shorter than Interval X or whether the state of element 121 remains ON for a period of time equal to or longer than Interval X. In the former case, embodiment 100 transitions to capture photo state S3. (For example, the interval timer or clock indicates time TB when the state of element 121 changes to OFF, and the interval timer or clock indicates a passage of time less than Interval X.) In the latter case, embodiment transitions to start video recording state S4. (For example, the interval timer or clock indicates a passage of time equal to or greater than Interval X.)

State S3 is a capture photo state of embodiment 100. Embodiment 100 can transition from timing start state S2 to capture photo state S3 via activity U23. Based on user interface activity U23, the state of element 121 changes to OFF. User interface activity U23 is exemplified as detecting the user interacting with the control surface of user interface 120 via a RELEASE action 126 at a time point within Interval X. Photo capture includes capturing an output image signal of image sensor 140 to provide captured photo data. Photo processing 180 processes the captured photo data for storage or transmission. Optionally, photo processing 180 may store the processed photo data into storage 190 for persistence. Optionally, embodiment 100 may provide visual and/or audio cue(s) indicating that photo capture has been successful (e.g., animation, sound). After photo capture, embodiment 100 may transition to idle state S1 and display visual cue(s) on or around the control surface of user interface 120, indicating the OFF state of element 121.

State S4 is a start video recording state of embodiment 100. Embodiment 100 can transition from timing start state S2 to start video recording state S4 via activity U24. The state of element 121 remains ON for a period of time equal to or longer than Interval X. User interface activity U24 is exemplified as detecting the user interacting with the control surface of user interface 120 via HOLD action 127 (e.g., continuing PRESS action 124) to Interval X. Starting video recording includes starting capturing an output image signal of image sensor 140 to provide captured video data. Starting video recording to provide the captured video data may also include starting capturing an output audio signal of audio sensor 130. Video processing 170 starts processing the captured video data. Optionally, video processing 170 may store the processed video data into storage 190 for persistence. Optionally, embodiment 100 may provide visual and/or audio cue(s) indicating that video recording has successfully started (e.g., animation, sound).

State S5 is a stop video recording state of embodiment 100. Embodiment 100 can transition from start video recording state S4 to stop video recording state S5 via activity U45. User interface activity U45 is exemplified as detecting the user interacting with the control surface of user interface 120 via a RELEASE action 128 at a time point beyond the end of Interval X. Stopping video recording includes stopping capturing an output image signal of image sensor 140 and may also include stopping capturing an output audio signal of audio sensor 130. Video processing 170 finishes processing the captured video data to provide a video for storage or transmission. Optionally, video processing 170 may store the video into storage 190 for persistence. Optionally, embodiment 100 may provide visual and/or audio cue(s) indicating that video recording has successfully ended (e.g., animation, sound). After video capture, embodiment 100 may transition to idle state S1 and display visual cue(s) on or around the control surface of user interface 120, indicating the OFF state of element 121.

Example Interactions

The following discussion provides non-limiting examples of interactions with embodiment 100. From the perspective of a user, the user utilizes the single user input element 121 in two distinct patterns in order to acquire different representations of visual content.

As a first interaction pattern, the user may employ a relatively short interaction with element 121. For instance, the first interaction pattern may comprise a sequence of a brief touch and a release or a sequence of a brief press and a release. The time duration of the first interaction pattern is shorter than a pre-defined time interval, Interval X. The first interaction pattern triggers still image acquisition (e.g., static picture).

As a second interaction pattern, the user may employ a relatively longer interaction with element 121. For instance, the second interaction pattern may comprise a prolonged touch and a release or a sequence of a prolonged press and a release. The time duration of the second interaction pattern is longer than the pre-defined time interval, Interval X. The second interaction pattern triggers the start of recording a dynamic (audio/video) representation of visual content. The release of element 121 triggers the end of the recording.

Alternative Embodiments

The following discussion provides additional non-limiting embodiments. Another embodiment employs a single-touch camera-mode selection/execution button. The button enables control over the selection and execution of a still camera and a video camera on a single device using a single button. In a first scenario, when the button is pressed and released quickly, the still camera is selected and takes still photo in a photo mode. In a second scenario, when the button is held down, the video camera is selected and takes a video taken for as long as the button is depressed in a video mode. Upon releasing the button, the video camera stops taking the video in the video mode. The photo and video modes are accessed and controlled by user via the same button and can be switched easily by the user using the two aforementioned scenarios.

Yet another embodiment is directed to combination Video Camera/Still Camera devices. Conventional combination Video Camera/Still Camera devices use a switch to toggle back and forth from Video Camera operation mode to Still Camera operation mode. This embodiment employs a single button to automatically switch back and forth between Video Camera operation mode and Still Camera operation mode without the need for a separate switch or toggle. When a user presses and holds the single button for a relatively long period of time (e.g., approximately 1-1.5 seconds or longer), the Video Camera is automatically engaged. As long as the user continues to press/hold the single button, the Video camera is engaged. Upon release of the single button, the operation mode is automatically switched back to Still Camera mode. When a user presses and quickly releases the single button, a photo is taken. The user can press/hold the single button at any time to automatically re-engage the Video Camera mode. The user can switch unlimited times between Still Camera operation mode and Video Camera operation mode by either pressing and holding the single button for Video using the Video Camera mode or by pressing and quickly releasing the single button for Photo using the Still Camera mode.

Although embodiments of this invention have been fully described with reference to the accompanying drawings, it is to be noted that various changes and modifications will become apparent to those skilled in the art. Such changes and modifications are to be understood as being included within the scope of embodiments of this invention as defined by the appended claims. 

What is claimed is:
 1. An apparatus for image capture comprising: one or more image sensors; one or more timing references; a single user input element, a processor block configured to control steps of: engaging the one or more timing references at a reference time R1 based on a user interface activity U1 at the single user input element, the user interface activity U1 occurring at a time T1; performing still image capture with an image sensor of the one or more image sensors in a first case that the one or more timing references indicates a passage of time from the reference time R1 to a reference time R2 is shorter than a time interval X, wherein the reference time R2 is based on a user activity U2 at the single user input element, the user interface activity U2 occurring at a time T2 after the time T1; starting video capture with an image sensor of the one or more image sensors in a second case that the one or more timing references indicates a passage of time since the reference time R1 is equal to or longer than the time interval X; stopping the video capture based on a user interface activity UN at the single user input element after the step of starting video capture in the second case.
 2. The apparatus of claim 1, wherein the reference time R1 is equal to or later than the time T1 and the reference time R2 is equal to or later than the time T1.
 3. The apparatus of claim 1, wherein all of the user interface activities U1, U2, and UN respectively comprise detecting an action based on the same type of physical manipulation of the single user input element.
 4. The apparatus of claim 1, wherein the user interface activity U1 comprises detecting a press action on the single user input element, and wherein each of the user interface activity U2 and the user interface activity UN comprises detecting a release action from the single user input element.
 5. The apparatus of claim 1, wherein the user interface activity U1 comprises a change in the state of the single user input element from OFF to ON, and wherein each of the user interface activity U2 and the user interface activity UN comprises a change in the state of the single user input element from ON to OFF.
 6. The apparatus of claim 1, wherein the image sensor of the step of performing still image capture is the same image sensor of the step of starting video capture.
 7. The apparatus of claim 1, wherein the single user input element is one of a button, a key, a screen area of an input display, and a touch-sensitive area of a touchscreen.
 8. A machine-readable storage medium for an image capture device comprising one or more image sensors, one or more timing references, and a single user input element, the machine-readable storage medium storing instructions that, when executed by one or more processors, cause the image capture device to perform a method comprising: engaging the one or more timing references at a reference time R1 based on a user interface activity U1 at the single user input element, the user interface activity U1 occurring at a time T1; performing still image capture with an image sensor of the one or more image sensors in a first case that the one or more timing references indicates a passage of time from the reference time R1 to a reference time R2 is shorter than a time interval X, wherein the reference time R2 is based on a user activity U2 at the single user input element, the user interface activity U2 occurring at a time T2 after the time T1; starting video capture with an image sensor of the one or more image sensors in a second case that the one or more timing references indicates a passage of time since the reference time R1 is equal to or longer than the time interval X; stopping the video capture based on a user interface activity UN at the single user input element after the step of starting video capture in the second case.
 9. The machine-readable storage medium of claim 8, wherein the reference time R1 is equal to or later than the time T1 and the reference time R2 is equal to or later than the time T1.
 10. The machine-readable storage medium of claim 8, wherein all of the user interface activities U1, U2, and UN respectively comprise detecting an action based on the same type of physical manipulation of the single user input element.
 11. The machine-readable storage medium of claim 8, wherein the user interface activity U1 comprises detecting a press action on the single user input element, and wherein each of the user interface activity U2 and the user interface activity UN comprises detecting a release action from the single user input element.
 12. The machine-readable storage medium of claim 8, wherein the user interface activity U1 comprises a change in the state of the single user input element from OFF to ON, and wherein each of the user interface activity U2 and the user interface activity UN comprises a change in the state of the single user input element from ON to OFF.
 13. The machine-readable storage medium of claim 8, wherein the image sensor of the step of performing still image capture is the same image sensor of the step of starting video capture.
 14. The machine-readable storage medium of claim 8, wherein the single user input element is one of a button, a key, a screen area of an input display, and a touch-sensitive area of a touchscreen.
 15. An apparatus for image capture comprising: one or more image sensors; one or more timing references; a single user input element, a processor block configured to control steps for performing photo capture and to control steps for performing video capture, the steps for performing photo capture comprising: engaging the one or more timing references at a reference time R1 based on a user interface activity U1 at the single user input element, the user interface activity U1 occurring at a time T1; engaging the one or more timing references at a reference time R2 based on a user interface activity U2 at the single user input element, the user interface activity U2 occurring at a time T2 after the time T1; determining a determination D1 that the one or more timing references indicates a passage of time from the reference time R1 to the reference time R2 is shorter than a time interval X; and performing still image capture with an image sensor of the one or more image sensors based on the determination D1; the steps for performing video capture comprising: engaging the one or more timing references at a reference time R3 based on a user interface activity U3 at the single user input element, the user interface activity U3 occurring at a time T3; determining a determination D2 that the one or more timing references indicates a passage of time since the reference time R3 is equal to or longer than the time interval X; starting video capture with an image sensor of the one or more image sensors based on the determination D2; stopping the video capture based on a user interface activity U4 at the single user input element after the step of starting video capture.
 16. The apparatus of claim 15, wherein the reference time R1 is equal to or later than the time T1, the reference time R2 is equal to or later than the time T1, and the reference time R3 is equal to or later than the time T3.
 17. The apparatus of claim 15, wherein all of the user interface activities U1, U2, U3, and U4 respectively comprise detecting an action based on the same type of physical manipulation of the single user input element.
 18. The apparatus of claim 15, wherein each of the user interface activity U1 and the user interface activity U3 comprises detecting a press action on the single user input element, and wherein each of the user interface activity U2 and the user interface activity U4 comprises detecting a release action from the single user input element.
 19. The apparatus of claim 15, wherein each of the user interface activity U1 and the user interface activity U3 comprises a change in the state of the single user input element from OFF to ON, and wherein each of the user interface activity U2 and the user interface activity U4 comprises a change in the state of the single user input element from ON to OFF.
 20. The apparatus of claim 15, wherein the image sensor of the step of performing still image capture is the same image sensor of the step of starting video capture.
 21. The apparatus of claim 15, wherein the single user input element is one of a button, a key, a screen area of an input display, and a touch-sensitive area of a touchscreen.
 22. A machine-readable storage medium for an image capture device comprising one or more image sensors, one or more timing references, and a single user input element, the machine-readable storage medium storing instructions that, when executed by one or more processors, cause the image capture device to perform a method comprising: steps for performing photo capture comprising: engaging the one or more timing references at a reference time R1 based on a user interface activity U1 at the single user input element, the user interface activity U1 occurring at a time T1; engaging the one or more timing references at a reference time R2 based on a user interface activity U2 at the single user input element, the user interface activity U2 occurring at a time T2 after the time T1; determining a determination D1 that the one or more timing references indicates a passage of time from the reference time R1 to the reference time R2 is shorter than a time interval X; and performing still image capture with an image sensor of the one or more image sensors based on the determination D1; steps for performing video capture comprising: engaging the one or more timing references at a reference time R3 based on a user interface activity U3 at the single user input element, the user interface activity U3 occurring at a time T3; determining a determination D2 that the one or more timing references indicates a passage of time since the reference time R3 is equal to or longer than the time interval X; starting video capture with an image sensor of the one or more image sensors based on the determination D2; and stopping the video capture based on a user interface activity U4 at the single user input element after the step of starting video capture.
 23. The machine-readable storage medium of claim 22, wherein the reference time R1 is equal to or later than the time T1, the reference time R2 is equal to or later than the time T1, and the reference time R3 is equal to or later than the time T3.
 24. The machine-readable storage medium of claim 22, wherein all of the user interface activities U1, U2, U3, and U4 respectively comprise detecting an action based on the same type of physical manipulation of the single user input element.
 25. The machine-readable storage medium of claim 22, wherein each of the user interface activity U1 and the user interface activity U3 comprises a press action on the single user input element, and wherein each of the user interface activity U2 and the user interface activity U4 comprises a release action from the single user input element.
 26. The machine-readable storage medium of claim 22, wherein each of the user interface activity U1 and the user interface activity U3 comprises a change in the state of the single user input element from OFF to ON, and wherein each of the user interface activity U2 and the user interface activity U4 comprises a change in the state of the single user input element from ON to OFF.
 27. The machine-readable storage medium of claim 22, wherein the image sensor of the step of performing still image capture is the same image sensor of the step of starting video capture.
 28. The machine-readable storage medium of claim 22, wherein the single user input element is one of a button, a key, a screen area of an input display, and a touch-sensitive area of a touchscreen. 